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A  strategy  is  provided  for  using  constrained  versions 
of  the  MDI  (minimum  discrimination  information)  statistic  to  test 
and  estimate  market  relations  involving  composite  hypotheses.  An 
algorithm  for  applying  the  tests  and  effecting  the  estimates  is 
also  provided  along  with  numerical  illustrations.  Other,  more 
general,  developments  in  statistics  and  mathematical  programming 
(duality)  theories  and  methods  are  also  briefly  discussed  for 
their  possible  bearing  on  further  uses  in  marketing  research  and 
management., 
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1.  INTRODUCTION* 


This  paper  centers  on  a  development  of  constrained  information 
theoretic  statistics1  with  accompanying  algorithms  and  illustration  for 
use  in  marketing.  It  is  pointed  toward  composite  hypothesis  testing,  as  in 
a  market  segmentation  analysis  with  explicitly  stated  constraints.  Sta¬ 
tistical  testing  --  in  multi-way  contingency  table  analyses,  for  instance  — 
is  not  usually  undertaken  with  explicitly  stated  arrays  of  constraints. 

The  recently  published  book,  The  Information  in  Contingency  Tables  by 

p 

Gokhale  and  Kullback  ,  however,  provides  requisite  statistical  methods  and 
rationales  for  such  treatments.  This  'n  turn,  opens  the  way  for  contact 
with  other  parts  of  the  management  sci..  '.es  where  complex  arrays  of  con¬ 
straints  are  often  used  to  reflect  a  variety  of  "policy"  and/or  "data" 
conditions. 

Our  illustrations  will  be  effected  from  the  data  published  in  [20]. 
Ehrenberg  and  Goodhardt  used  these  data  to  show  that  (a)  the  Hendry  Brand 
Switching  Coefficient  approach  to  market  segmentation  does  not  yield  very 
good  estimates  of  brand  switching  behavior  and  (b)  this  market  —  like  most 
markets  --  is  unsegmented  (i.e.,  it  consists  of  a  single  segment). 

*The  authors  are  indebted  to  D.G.  Morrison  for  editorial  sugges¬ 
tions  that  have  helped  to  improve  this  revision  from  an  earlier  draft. 

^.e.,  we  are  referring  to  the  Kullback-Leibler  statistic  as 
found  in  Kullback  [31]  rather  than  the  probabilistic  or  deterministic 
approaches  built  around  the  Shannon-Wiener  measure  of  information  as 
treated  in,  say,  Theil  [39]  and  [40]. 

2See  [23], 

2In  a  private  communication  from  G.  H.  Goodhardt  dated  September  27, 
1979,  he  states  that  it  has  been  their  experience  that  most  markets  can  be 
adequately  described  as  being  unsegmented. 
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They  do  not,  however,  submit  their  results  to  statistical  tests. 

We  shall  show  how  the  Ehrenberg-Goodhardt  results  can  be  sta¬ 
tistically  tested  relative  to  other  alternatives  by  reference  to  their  data 
but  we  do  not  propose  to  enter  into  yet  another  discussion  of  the  Hendry 
approach  to  market  segmentation.1  Hence,  we  shall  treat  their  data  as 
simply  a  numerical  illustration  for  the  use  of  the  models  and  methods  we 
shall  supply.  Approaching  these  data  in  this  manner  also  accords  with  the 
fact  that  we  do  not  know  the  sample  size  or  conditions  of  selection.  We 
also  do  not  know  the  source  of  the  data  and, hence,  we  shall  approach  these 
data  in  a  way  that  can  accord  with  either  panel  or  survey  collections  of 
such  information  from  respondents.  For  similar  reasons,  we  shall  steer 
clear  of  full-scale  segmentation  analysis  by  reference  to  product  benefits, 
customer  (demographic)  characteristics,  etc.,  since  (a)  this  accords 
most  closely  with  the  Ehrenberg-Goodhardt  results  and  (b)  the  requisite 
data  are  not  available  for  these  purposes  anyway.  The  following  develop¬ 
ment  thus  provides  only  a  beginning  but,  as  will  be  seen,  it  is  especially 
well -suited  for  statistically  testing  "nested  hypotheses"  such  as  are 
implicit  In  market  segmentation  studies.  As  we  shall  also  note  (and  illus¬ 
trate),  however,  these  constrained  information  theoretic  models  and  methods 

possess  statistical  estimation  as  well  as  hypothesis  testing  properties 
that  may  be  simultaneously  exploited. 


^ee,  e.g.,  the  critique  by  Ehrenberg  and  Goodhardt  [21]  of 
the  presentation  by  Kalwani  and  Morrison  [30]. 
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2.  BACKGROUND 


Marketing  professionals  are  familiar  with  the  use  of  two-way 
contingency  tables  (cross-tabulations) .  Formally,  such  two-way  analyses 
can  be  extended  to  multi-way  contingency  tables,  but  then  one  confronts 
a  variety  of  inadequacies  in  classically  available  statistical  methods. 

The  unsatisfactory  nature  of  our  ability  to  deal  with  large  multi-way  con¬ 
tingency  tables  has  begun  to  give  way  before  many  different  separate  devel¬ 
opments  in  the  statistics  (and  computing  sciences)  literature.  The  re¬ 
sulting  proliferation  of  methods  has,  in  turn,  given  rise  to  a  need  for 
systematization  accompanied  by  a  unifying  rationale  based  on  methodologi¬ 
cal  as  well  as  conceptual  grounds.  The  recently  published  book,  Discrete 
Multivariate  Analysis,  by  Bishop,  Fienberg  and  Holland  [5]  explicitly1 
acknowledges  that  their  effort  was  undertaken  in  response  to  this  need. 

These  authors  use  the  “log-linear  model"  as  a  basis  for  "codifying"  approaches 
that  might  be  employed.  The  approach  via  log-linear  models  falls  short  of 
what  is  required,  however,  to  supply  the  unifying  rationale  that  is  needed. 
This  is  supplied  by  Gokhale  and  Kullback  in  [23]  who  use  the  MDI  statistic 

for  this  purpose  and  who  show  how  the  log  linear  model  itself  can  be  de- 

2 

rived  from  this  statistic  along  with  a  variety  of  other  approaches.  They 
also  show  how  extensions  beyond  any  of  these  other  approaches  --  e.g.,  in¬ 
cluding  explicitly  adding  constraints  beyond  those  of  the  contingency 
tables  —  can  be  effected  by  following  along  these  "information  theoretic" 

1 i nes . 


^p.  1-3. 
2Pp.  38-39. 
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The  history  of  the  development  of  the  MDI  statistic  is  a  rather 
curious  one,  at  least  as  far  as  the  American  literature  of  statistics  is 
concerned.  It  derives  from  the  so-called  "Kullback-Leibler  statistic" 
which  is  of  the  form 

(1.1)  I  ( p : 7T )  =  2  p.-  log 

1=1  1  ffi 


where  p  and  tt  are  vectors  with  components 

Pi,  *1  i  0 

n  n 

(1.2)  Z  p.  »  E  tt.  =  1  . 

i  =1  1  i =1  1 

Here  the  tt i  may  represent  a  set  of  hypothesized  (constant)  values  which 
are  to  be  tested  relative  to  the  p^  (variable)  values  estimated  from  ob¬ 
served  data.  The  tt..  may  also  represent  a  set  of  prior  probabilities  as 
in  Bayesian  decision  theory  and  then  the  p..  become  posterior  probabilities 
determined  from  sample  observations.  Other  interpretations  are  also  pos¬ 
sible  which  accord  with  different  ways  of  chosing  the  p^ .  When  the 
components  of  p  represent  minimizing  values,  p*,  the  expression  in  (1.1) 
may  be  replaced  by 

n  p.*  n  p. 

(2)  I  ( p* : tt)  =  Z  p4*  log  =  min  I  p.  log  —  , 

i  =1  1  77  i  p  1  =  1  1  tt. 


and  this  is  called  the  MDI  statistic. 
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The  expression  in  (1.1)  is  often  confused  with  the  Shannon-Wiener 
measure  of  information.  The  latter,  however,  represents  a  probabilistic 
rather  than  a  statistical  concept.  In  fact,  as  Kullback  notes  in  his 
book.  Information  Theory  and  Statistics*,  L.  0.  Savage,  one  of  the  founders 
of  modern  decision  theory,  thought  of  it  only  in  these  terms  when  he  re¬ 
marked,  "The  ideas  of  Shannon  and  Wiener,  though  concerned  with  probability, 

seem  rather  far  from  statistics."  Kullback,  however,  goes  on  to  show  how 

o 

both  (1.1)  and  (2)  can  be  given  a  statistical  characterization  and  used 
to  unify  an  extremely  wide  variety  of  statistical  concepts  and  developments. 
Kullback's  contributions  are  appropriately  acknowledged  in  the 
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literature  which  refers  to  (1.1)  as  the  "Kullback-Leibler  statistic." 

He  is  also  the  author  of  the  name  Miniumum  Discrimination  Information  (MDI) 
statistic  which  name  may  be  explained  in  the  following  way:  Consider  any 
estimate  of  p  in  (1.1)  which  yields  a  distribution  that  significantly 
differs  from  the  distribution  as  hypothesized  in  n.  If  the  p  distribution 
differs  significantly  from  the  hypothesized  tt  distribution  then  a  question 
may  remain  as  to  whether  some  other  estimate  of  p  (also  consistent  with  the 
cbta)  might  fail  to  exhibit  significance.  The  matter  is  resolved,  however,  if 
p  is  chosen  "as  close  as  possible"  to  it.  This,  then,  is  the  meaning  of  mini¬ 
mum  discrimination,  i.e.,  p*  provides  minimum  discrimination  against  the 
hypothesized  it  that  the  data  together  with  any  other  constraints  will  allow. 


*[31]  p.2. 

2Also  called  the  "Khinchin-Kullback-Leibler  statistic"  because  of 
earlier  (but  simpler)  contributions  of  A.  I.  Khinchin  in  his  book  on  the 
Mathematical  Foundations  of  Statistical  Mechanics  (New  York:  Dover  Publica- 
tions,  1<539).  See  [17], 

3See  [25]. 


The  additional  constraints  allowed  in  Gokhale-Kullback  [23]  are 
of  the  form 

(3)  j  ai  jpj  =  6i  »  1  =  1 »  2 ,  ...»  m  . 

When  the  6i  are  derived  from  the  data,  as  in  the  case  of  marginal  totals 
for  a  contingency  table,  then  the  constraints  are  said  to  be  "internal 
constraints".  When  they  are  imposed  on  the  basis  of  various  hypothesized 
premises,  as  in,  for  instance,  an  assumed  segmentation,  then  the  con¬ 
straints  are  said  to  be  "external  constraints."  It  is  the  latter  which 
will  be  emphasized  here. 

Notice  the  flexibility  that  is  allowed  by  reference  to  the  pos¬ 
sibility  of  testing  hypotheses  with  the  tt  values  in  the  functional  or  the 
G  values  in  the  constraints.  Choice  of  the  minimizing  p*  in  terms  of 
(1.1),  (2)  and  (3)  also  suggests  the  possibility  of  contact  with  mathemati¬ 
cal  programming  with  its  great  computational  power  and  the  variety  of  inter 
pretations  that  are  available  from  the  underlying  duality  relations.  These 
prospects,  too,  have  now  been  formally  comprehended  as  in  [ 10] 1  with  the 
result  that  an  unusually  simple  (unconstrained)  convex  programming  problem 
is  found  to  be  the  dual  to  minimizing  (1.1)  subject  to  (1.2)  and  (3)  , 
which  dual  is 

n  n 

(4)  Max  i:  e.z.  -  i  TT.  e x p ( >:  z.a..) 

z  i=0  1  1  j=i  J  i  1  1J 

^See  also  [17]. 

2 

It  is  perhaps  of  interest  to  note  that  (1.1)  provides  a  "proper 
goal  functional"  in  the  sense  of  [11]  for  use  in  goal  programming. 
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where  eQ  =  1,  aQj  =  1,  j  =  1,  ...n  and  "exp"  denotes  the  exponential, 
e  =  2.718....  See  [10]  and  [17],  and  observe  that  the  choices  of  the  com¬ 
ponents  in  z  are  not  constrained. 

One  way  to  approach  the  need  for  the  kind  of  general  assessment 
we  are  making  is  to  say  that  the  literature  of  marketing  research  reflects 
the  fact  that  the  American-English  statistics  literature  has  remained 
relatively  quiescent  on  the  subject  at  least  as  far  as  applications  are  con¬ 
cerned  since  the  appearance  of  Kull back's  book.  This  has  not,  however, 
been  true  of  other  statistics  literatures.  Both  Soviet  and  Japanese  stat¬ 
isticians  have  continued  to  push  forward  vigorously  along  the  paths  opened 
by  Kullback  in  his  statistical  characterizations1.  The  Japanese  statisti¬ 
cian,  H.  Akaike,  for  example,  presented  a  very  important  paper  at  a  Soviet 
sponsored  conference  in  which  he  showed  how  maximum  likelihood  estimation 
(the  heart  of  classical  statistics)  could  be  given  a  formulation  asymp¬ 
totically  equivalent  to  MDI.  The  MDI  method  has  the  unique  characteristic 
of  providing  both  hypothesis  testing  and  statistical  estimation  regardless 
of  the  conclusion  of  the  test.  It  thus  provides  a  decision  theoretic 
method  which  unifies  hypothesis  testing  and  estimation. 

Regression  (sometimes  of  a  logit  or  probit  variety)  and  factor 
analysis  are  often  used  techniques  in  marketing  research.  Current  statis¬ 
tical  procedures  usually  determine  the  number  of  terms  to  be  used  on  a 
trial  and  error  --  or  exhaustive  sequential  --basis.  Akaike  shows  in  [1], 


*The  economist,  H.  Theil,  has  also  pushed  forward  vigorously,  but 
largely  along  lines  that  are  either  deterministic  or  probabilistic  in 
character,  to  show  how  an  extremely  wide  variety  of  social  and  economic 
problems  might  be  addressed.  See  [39]  and  [40]. 

^See  [1]. 
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however,  that  the  MDI  approach  immediately  determines  the  number  of  terms 
to  be  used  from  the  sample  data  by  the  MDI  decision  theoretic  criterion 
alone. 1 

There  are  other  advantages  that  can  also  be  secured.  For  example, 
as  we  have  elsewhere  shown  [16],  MDI  can  provide  a  basis  for  unifying  a 
great  variety  of  seemingly  separate  (and  different)  approaches  to  marketing 
research.  Here  we  want  to  show  how  to  bring  it  to  bear  on  problems  such 
as  the  composite  hypothesis  testing  required  for  market  segmentation  testing. 


3.  A  SEGMENTATION  THEOREM  AND  ALGORITHM 

Turning  to  the  marketing  literature,  only  a  few  example  appli- 

2 

cations  are  available  and  these  generally  take  the  form  of  the  simpler 
"entropy"  formula 

n 

(5)  T.  p,  log  p. 

i  =1  1  1 


applied  to  areas  like  brand  switching  or  mar’.et  segmentation. 

Examples  of  uses  of  the  entropy  concept  in  the  marketing  litera- 
ture  may  be  found  in  the  articles  by  Hernite.1’  [27],  [28]  and  by  Bass  [4]  . 

*Akaike  also  brought  these  methods  to  bear  on  the  so-called 
"James-Stein  paradox"  wherein  by  the  use  of  seemingly  irrelevant  data  one 
can  secure  improved  estimates  which  are  not  only  more  efficient  than  the 
mean  (a  maximum  likelihood  estimator)  but  which  also  even  eliminate  the  mean 
from  the  admissible  class  of  estimators  in  a  decision  theoretic  sense.  See 
[2]  and  [3]  where  Akaike  also  discusses  the  inadequacies  of  Bayesian  approaches 
to  this  topic.  (An  amusing  and  insightful  article  on  the  Oames-Stein  paradox 
may  be  found  in  [18]  and  a  more  general  treatment  of  the  deficiencies  of 
maximum  likelihood  estimators  may  found  in  [41].) 

2 

Theil  [39]  and  [40]  provides  numerous  examples  in  which  expressions 
(1.1)  and  (2)  are  used  but,  as  previously  noted,  these  are  generally  accorded 
deterministic  or  probabilistic  interpretations  and  treatments. 

^See,  also  Carter  [9]. 
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Variants  and  alternatives  to  it  may  be  found  in  the  article  by  Kalwani  and 
Morrison  [30]  as  well  as  in  other  articles  dealing  with  the  Hendry  approach 
to  brand  switching  and  market  segmentation  analysis.  See,  e.g.,  [8] 
and  [26]. 

We  elect  to  make  contact  with  this  part  of  the  marketing  literature, 
but  without  entering  into  a  full-scale  discussion  of  the  Hendry  approach 
to  segmentation.  An  easy  way  to  do  this  is  via  the  data  of  Table  1  which 
Ehrenberg  and  Goodhardt  [19]  used  to  test  the  Hendry  approach  and  concluded 
that,  on  the  evidence,  this  market  does  not  exhibit  any  segmentation  at  all. 

In  arriving  at  this  conclusion,  Ehrenberg  and  Goodhardt  use  only 
a  first-order  analysis  of  the  market  shares.  Within  a  Hendry  type  analysis 
one  must  check  to  insure  that  the  switching  constants  derived  from  these 
ratios  "is  applicable  to  the  total  product  class  as  well  as  to  the  in¬ 
dividual  brands  within  a  product  class."1  More  generally  one  might  com¬ 
mence  with  the  (relative)  market  share  shown  for  each  product,  and  then 
go  on  to  consider  pairs,  followed  by  triplets,  and  so  on,  to  the  2n- 1=127 
possibilities  for  the  example  shown  in  Table  1.  The  idea  is  to  check  to 
see  whether  the  resulting  ratios  are  approximately  the  same  in  order  to 
avoid  (or  reduce)  the  danger  of  reaching  erroneous  conclusions  from  the 
(perhaps  accidental)  equality  of  lower  order  ratios. 

Actually  Ehrenberg  and  Goodhardt  use  only  a  first  order  analysis. 
That  is,  they  do  not  undertake  any  of  these  higher  order  calculations,  but 
their  approach  is  nonetheless  satisfactory  by  virtue  of  the  following 
theorem: 


Quoted  from  Kalwani  and  Morrison  [30],  p.  420.  See  also  p.  473. 


. . t 


Theorem: 


2  _  _  nn 

•••  'r« 

T  -A 

for  ill  ke  {1,  ....  n}  and  all 
seS  s  SC{1,  ....  n}, 

where  A.,  are  real  numbers  with  all  /  0.  In  other  words,  when  the 

first  order  ratios  are  equal  then  all  the  higher  order  ratios,  however 

they  may  be  formed,  will  also  be  equal. 

This  theorem,  which  we  have  proved  elsewhere*,  constitutes  the 

first  part  of  our  proposed  algorithm  and  enables  us  to  follow  Ehrenberg 

and  Goodhart  in  restricting  ourselves  to  the  simple  situtation  of  only 

first  order  calculations  in  the  use  of  formula  (6),  below.  We  do  not 

follow  Ehrenberg  and  Goodhardt  into  an  examination  of  the  "entropy  function" 

2 

variants  which  enter  into  the  Hendry  approaches  for  computing  such  ratios  . 

We  simply  observe  that  the  above  theorem  holds  without  reference  to  the 

way  the  A  and  B  values  are  obtained  and  then  pass  on  to  an  initial 
s  s 

segmentation  via 

Pi  .  p(1.D 

Ri  =  pToT^T 

(6)  1 

where  p..  a  market  share  for  brand  i 

p(i,i)  =  proportion  making  repeat  purchases  of 

brand  i. 


If 


‘1  _ 


Then 


‘k 


*See  [14]. 

"The  use  of  these  entropy  functions  in  Hendry  analyses  are  dis¬ 
cussed  in  Kalwani  and  Morrison  [30],  who  apparently  believe  they  can  be 
dispensed  with,  and  also  by  Ehrenberg  and  Goodhardt  [21]  who  believe  that 
they  are  a  distinguishing  feature  of  Hendry  type  analyses. 
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Before  applying  this  formula  to  the  data  of  Table  1,  below,  we 
reiterate  that  we  are  not  aware  of  the  way  the  data  In  Table  1  were 
obtained.  Neither  all  rows  nor  all  columns  sum  to  unity  —  see  Table  2  — 
so  that  evidently  the  matrix  is  not  Markovian.  We  could,  of  course, 
induce  this  property  by  dividing  through  by  the  row  or  column  rim  values. 
Ehrenberg  and  Goodhardt  do  not  use  the  table  in  this  "Markovian"  fashion 
in  [20], however,  and  the  fact  that  Ehrenberg  has  been  openly  critical  of 
the  use  of  first-order  Markov  processes  in  describing  consumer  behavior* 
disinclines  us  to  this  approach.  We  shall  therefore  interpret  the  p(i,i) 
as  joint  rather  than  conditional  probabilities. 

Applying  (6)  to  the  data  of  Table  1  we  obtain  the  following 

results 


Rj  =  0.660  R4  *  0.719 

(7)  R2  =  0.656  R5  *  0.750 

R3  =  0.723  Rg  =  0.704 


Using  the  above  theorem  we  then  assert  that  i  and  j  are  in  the  same 
classification^hen  R.  =  R,  .  Following  Ehrenberg  and  Goodhardt  [20],  we 

*  J 

have  used  the  average  of  the  row  and  column  marginal  values  in  arriving 
at  these  results.  This  assumes  "market  share  equi^ibrium",,  between  the 


*See  [19]  including  his  rejoinder  to  the  Massey  and  Morrison 
critique  noted  there. 

2 

The  approximation  indicated  by  is  made  clear  in  (7)  and  (8) 
and  subsequently  subjected  to  statistical  tests.  A  general  (more  precise) 
statement  of  this  approach  is  formalized  as  the  SSI  theorem  in  [14j. 

3 See  [20]. 


c yn^i  a  u 


12 


two  periods,  a  hypothesis  that  is  subject  to  test  along  lines  that  we 
shall  indicate.  This,  however,  is  not  the  only  possible  equilibrium  that 
might  be  of  interest  for  marketing  purposes.  For  instance,  we  shall  in¬ 
troduce  the  idea  of  "switching  equilibrium"  and  show  how  this,  too,  may  be 
tested. 

Ehrenberg  and  Goodhardt  conclude  from  their  analyses  in  [20]  that 
this  market  does  not  segment  into  different  classes.  I.e.,  they  conclude 
that  thtit  ;•  only  one  "homogeneous"  market  in  which  all  products  compete.* 
They  do  not  test  this  hypothesis  by  statistical  methods,  however,  as 
we  shall  do.  This  will  be  done  as  a  byproduct  of  our  use  of  a  series  of 
composite  hypothesis  testing  procedures  with  an  accompanying  algorithm 
starting  with  the  following  three  different  classes  (or  segments)  which 
our  rule  of  approximate  equality  (R.  =  R.)  provisionally  suggests. 

*  J 

Ij  »  {1.2} 

(8)  I2  =  {3,4,6} 

h  =  {5} 

TABLE  1 

The  Observed  Bra nd_- S w itching  Percentages 
for  Six  Brands  of  Breakfast  Cereals 

(Two  successive  purchases  per  consumer) 

_ Second  Purchase  All 

Product  First ^Purchase  1 _ _ Z _ 3 _ a  5. _ 6_ 


Corn  Flakes 

(23. a) 

7.7 

3.3 

2.0 

1.3 

1.4 

39.5 

Wee tab ix 

6.4 

04.3) 

3.3 

1.1 

.8 

1.1 

27.0 

Shredded  Wheat 

3.6 

3.2 

(5.4) 

.8 

.8 

.7 

M.5 

Sugar  Puffs 

3.2 

1.5 

1.1 

(3.1) 

.8 

.3 

10.0 

Puffed  Wheat 

1.7 

.6 

.4 

.6 

(1.5) 

.1 

4.9 

Brand  X 

1.0 

.4 

.6 

.3 

.3 

(1.5) 

4.1 

All  Buyers 

39.7 

27.7 

14.1 

7.9 

5.5 

5.1 

100.0 

1 


See  Ehrenberg  and  Goodhart  [20]. 


■  >i'-.  ^!au,i  -  ■ 


4.  VULNERABILITY  RATIOS 


These  R.  values  are  sometimes  referred  to  as  "switching  constants" 
—  e.g.,  in  a  Hendry  type  analysis.  We  prefer  the  term  "vulnerability 
ratios",  however,  so  that  the  above  development  produces  classes  of 
equally  vulnerable  products. 

Justification  for  this  change  in  terminology  my  be  presented  in 
terms  of  (6)  by  rewriting  it  in  the  following  "verbalized"  form: 


(9) 


.  repeat  purchase  rate  for  i 
_  1  "  brand  share  for  1 

1  1  -  brand  share  for  i 


It  is,  as  can  be  seen,  a  consumer  outflow  measure  for  brand  I.1  Without 
allowing  for  inflows  it  becomes  a  measure  of  brand  i's  vulnerability  to 
such  outflows.  Hence,  our  characterization  of  it  as  a  "vulnerability 
ratio." 

The  smaller  the  value  of  this  ratio,  the  more  stable  (or 
invulnerable)  is  the  brand  in  the  sense  that  smaller  R..  reflect  an  in¬ 
creasing  proportion  of  repeat  buyers  In  the  brand's  present  market  share. 
For  instance,  if  R.  =0,  then  the  brand  share  for  1  *  the  repeat  pur¬ 
chase  rate  for  i,  and  all_  of  this  brand's  customers  are  repeat  buyers. 


XTh1 s  would  be  true  even  if  the  p(i,i)  were  Interpreted  as 
conditional  probabilities,  as  in  a  Markoff  type  approach,  rather  than 
the  joint  probability  Interpretation  which  we  are  using. 
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We  are  keeping  these  interpretations  general  and  restricting  our 
terminology  accordingly.  For  instance,  we  do  not  assess  the  reasons  for 
the  observed  behavior  and  we  allow  for  a  variety  of  assumptions  (e.g,,  of  a 
probabilistic  character)  which  may  accord  with  more  specialized  statistical 
distributions  used  by  others.  As  a  case  in  point  we  may  cite  Sabavala 
and  Morrison  [3£]  who  use  a  Beta-Binomial  development  to  obtain  what  they 
refer  to  as  a  "Loyalty  Index"  for  viewing  TV  programs.*  In  this  case 
more  special  (and  precise)  characterizations  are  possible  since  the  condi¬ 
tion  R.  a  0  then  also  implies  a  very  bipolar  population  in  which  consumers 
all  have  p  values  near  0  or  1  for  any  such  brand. 

Results  like  the  above  yield  additional  hypotheses  for  testing 
—  i.e.,  situations  particularized  further  below  the  general  situation 
p^  =  p...  which  applies  in  our  case  —  which,  of  course,  is  consistent  not 
only  with  high  or  low  p..  values  but  with  intermediate  values  as  well. 

The  use  of  specializing  assumptions  may  also  restrict  the  range 
of  possible  hypotheses  on  a  priori  grounds.  For  instance,  an  assumed 
Beta-Binomial  will  also  restrict  the  range  of  R^  values  to  0  <  R..  <  1 
whereas  our  more  general  situation  admits  of  values  Ri  >  1.  Within 
the  range  of  such  restrictions,  then,  we  may  regard  the  Sabavala-Morrison 
"Loyalty  Index"  as  the  complement  our  "Vulnerability  Ratio". 


*We  are  indebted  to  D.G.  Morrison  for  pointing  out  the  additional 
possibilities  and  relations  derivable  from  this  article. 


In  general,  we  may  think  of  brands  with  0  £  R.  <  1  as  being 
relatively  invulnerable.  At  R..  =  1  we  have 

(10)  p(i,1)  -  Pj2  . 

If  we  interpret  these  ratios  in  terns  of  probabilities,  then  we  can  say  that 

at  Rj  =  1  the  repeat  purchases  are  statistically  independent  identically 
•  o 

distributed  events.  Finally,  at  values  R..  >  1  we  would  have  p.  >  p(1,i) 

in  which  case  repeat  buying  would  be  even  worse  than  random  and  the  brand 

highly  vulnerable.1 

This  kind  of  information  can  be  put  to  use  in  developing  market 
strategies.  For  instance,  it  directs  attention  to  the  possibilities  of 
attracting  customers  from  brands  with  R^  values  exceeding  one's  own.  Con¬ 
versely,  it  warns  of  possible  inroads  from  brands  with  smaller  values, 
which  Inroads  may  then  have  some  staying  power  that  increases,  e.g., 
because  of  repeat  purchasing  propensities. 

Differences  in  observed  R^  values  may  not  be  statistically  sig¬ 
nificant,  in  which  case  the  above  interpretations  will  need  to  be  modified 
to  accord  with  this  fact.  Testing  for  statistical  significance  of  these 
and  other  hypotheses  is  a  task  to  which  we  shall  now  turn,  but  our  focus 
on  this  aspect  of  testing  does  not  mean  that  we  believe  that  other  consid¬ 
erations  such  as,  e.g.,  application  of  marketing  Insight,  are  unimportant. 

5.  TYPICAL  FORMS 

We  will  use  specializations  of  (1.1)  and  (2)  and  (3)  to  illustrate 
our  procedures  for  testing  the  structure  of  a  market  in  the  form  of  a  series 
of  "nested  hypotheses"  starting  with  (8).  We  now  Introduce  a  double  sub¬ 
script  for  the  discrete  probability  distributions  with  which  we  are  concerned. 

2 

The  model  is 

^ne  would  generally  expect  such  situations  to  be  rare  and  fairly 
short-lived  —  although  one  could,  clearly,  arrange  to  have  a  product  and 
accompanying  market  strategies  which  would  produce  such  behavior  almost  at  will. 

2The  well-known  SUMT  program  of  Flacc.o  and  McCormick  [22]  Is 
available  for  solving  this  kind  of  problem  via  its  dual. 
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(11) 


where 


p.  ■ 

Minimize  I(p  =  z  p.,  In  -JJ- 

U  U  1 ,j  iJ  "ij 


subject  to 


’V/U-  Pij  i  0.  Vi.J, 

I  i%J 

0  -  R.  -  Rj  ,  etc., 


(12) 


Ri  - 


R.  * 
J 


1  -  p(i * i )/ 

1  ’  Pi 

1  -  P(j»j)/Pj 
1”  5". 


and  the  p^  and  p^  indicate  posited  market  share  values,  i.e.,  these  values 

are  not  to  be  estimated  by  this  minimization  procedure.  Herep(i,i)  =  p^  and  the 

p.  .  represent  proportions  switching  between  products  i  and  j  over  the  two 
^  J 

purchase  occasions. 

The  estimated  R.  *  R.  represent  vulnerability  ratios  hypothesized 

*  J 

to  accord  with  "nestings"  indicated  by  arrangements  like 


A  A 


when  products  i,  j  and  k  are  hypothesized  to  be  in  the  same  vulnerability  class. 

To  simplify  notation  we  omit  the  circumflexes  on  the  R^,  Rj,  R^  and 
designate  optimal  estimates  by 


1  -  piV^1 

1  -  P, 

1  -  pjViij 


04) 
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The  resulting 


(15)  I*  -  I  (p,J  ,  .„) 

may  then  be  used  to  test  the  hypothesis  that  the  minimizing  choices  do 
not  deviate  from  the  distribution  associated  with  the  tt^.  The 
represent  "switching  proportions"  hypothesized  in  our  case  to  accord  with 
the  data  represented  in  Table  2, 

TABLE  2 


K 

1 

2 

3 

4 

5 

6 

i 

.238 

.077 

.033 

.020 

.013 

.014 

2 

.064 

.143 

.033 

.011 

.008 

.011 

3 

.036 

.032 

.054 

.008 

.008 

.007 

4 

.003 

.015 

.011 

.031 

.008 

.003 

5 

.017 

.006 

.004 

.006 

.015 

.001 

6 

.010 

.004 

.006 

.003 

.003 

.015 

where  the  data  of  Table  2  are  the  Table  1  data  expressed  as  fractions. 
The  j  values  are  to  be  secured  by  solving  (11)  using  alj_  of  the  data 
(off-diagonal  as  well  as  diagonal)  in  Table  2. 

A  characterization  which  provides  access  to  the  statistical 
properties  as  developed  in  Kullback  [311  for  the  MDI  statistic  may  be  ob¬ 
tained  from  the  often  made  remark  that  it  has  been  widely  observed  (i.e., 
in  many  markets)  that  brand  switching  Is  proportional  to  brand  share1  and, 

^ee  Ehrenberg  and  Goodhardt  [20] 


18 


with  regarded  as  a  switching  constant,  this  is  reflected  in  (6).  However, 
this  "empirical  law"  may  itself  be  regarded  as  being  implied  by  the  assump¬ 
tion  that  the  underlying  distribution  which  yields  these  switches  is  multi¬ 
nomial  in  character  with  probabilities  ,  i  a  1,2,. ..,n.  It  then  follows 
from  the  multivariate  form  of  the  central  limit  theorem  that  the  observa¬ 
tions  tend  to  the  multi-normal  distribution  with  increasing  sample  size.1 
Thus,  from  these  considerations  alone,  one  may  anticipate  such  an  empirical 
law  directly  from  the  fact  that  the  dispersion  (covariance)  matrix  for  large 
samples  will  be  symmetric  with  off-diagonal  terms  proportional  to  the 
product  of  the  corresponding  brand  shares.  Convarsely,  a  finding  of  sta¬ 
tistically  significant  deviations  will  result  in  rejection  of  the  "empiri¬ 
cal  law"  that  switching  is  proportional  to  brand  share.  Rejection  of  this 
typothesis  can  serve,  in  marketing  terms,  as  an  alert  to  management  that 
the  market  structure  is  changing. 

6.  ALGORITHM  AND  TESTING  PROCEDURE 

The  theorem  that  we  provided  in  Section  3  allowed  us  to  avoid  one 
part  of  the  combinatorial  number  of  possibilities  that  might  be  encountered 
in  a  market  segmentation  analysis.  Another  part  jf  „nese  combinatorial  diffi¬ 
culties  is  encountered  v;nen  we  come  to  effecting  our  stastical  tests  since 

A  A 

one  might  (in  principle)  have  to  consider  a  great  number  of  possible  R^,  R^ 
pairs  to  test  in  (11). 

^or  a  more  detailed  discussion  with  accompanying  further 
references,  see  fl4]. 


-w=i- 


We  can,  of  course,  restrict  our  tests  to  the  already  suggested 
possibilities  in  (8)  when  we  are  confident  that  this  will  not  omit  other 
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possibilities  that  might  also  be  valid.  More  generally,  however,  and 
even  in  the  present  case,  a  greater  variety  of  possibilities  may  need 
to  be  considered.  We  therefore  suggest  the  following  as  a  second  part 
to  the  algorithm  which  we  have  already  provided  via  the  theorem  in 
Section  3. 

This  second  part  of  the  algorithm,  which  is  to  be  used  when  statist!’ 
cal  testing  is  to  occur,  may  be  stated  as  follows: 

Order  the  R^  values  from  smallest  to  greatest  as 
R(l),  r(2)'  *•••  R(n)'  as  Table  3  below.  See  (7).  By 
MDI,  test  the  hypothesis  that  and  R^  are  equal  at  a 
specified  level  of  significance.  If  the  resulting  value  of 
the  MDI  statistic  rejects  the  hypothesized  equality  then  seg¬ 
ment  R^  from  R^2).  Next  begin  with  R^)  as  a  smallest  R. 
to  test  for  a  further  segment  apart  from  R^.  If,  instead, 
the  test  accepts  the  hypothesized  equality  of  R^  and  R^)» 
add  the  additional  condition  to  R^  ■  R^)  B  R(3)*  Test  with 
the  MDI  statistic.  If  the  test  accepts  the  hypothesized  equality 
of  R^and  go  on  to  new  higher  R^  by  adding  the  new  per¬ 
tinent  additional  equality  conditions. 

To  illustrate  our  suggested  procedures  for  using  these  MDI  values, 
we  now  refer  to  the  following  R^  values  from  the  results  portrayed  in  (7), 
but  arranged  as  indicated  by  this  part  of  our  algo  ;thm. 

TABLE  3 

i:  2  1  6  4  3  5 

R.j :  .650 


.660  .704  .719  .723  .750 
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Applying  this  algorithm  we  test  the  hypotheses  indicated  by  the 
nestings  (from  top  to  bottom)  under  the  column  labelled  "Null  Hypothesis" 
in  Table  4.  With  this  formulation,  Ehrenberg  and  Goodhardt  apparently 
are  correct,  or  at  least  their  "one  homogeneous  market"  conclusion  can¬ 
not  be  rejected  on  the  basis  of  these  data  for  all  sample  sizes,  N  _<  6,500 
and  significance  level  a  =  0.95.* 


TABLE  4 


Problem 

Null 

Hypothesis 

I* 

d.f  .# 

N  <  6500 

a  =  0.95 

1 

Rz” 

R1 

Negligible 

1 

accept 

2 

R2 

R1  ‘  R6 

1.24  x  10"4 

2 

accept 

3 

R2  " 

R1  *  R6  =  R4 

3.60  x  10"4 

3 

accept 

4 

V 

R1  =  R6  =  R4  =  R3 

5.94  x  10"4 

4 

accept 

5 

r2  = 

R1  =  R6  *  R4  “  R3  "  R5 

8.54  x  10~4 

5 

accept 

#  See  Gokhale  and  Kullback  [23]  or  [24]. 


There  is,  however,  another  more  stringent  test  of  market  homo¬ 
geneity  based  on  requiring  the  estimated  p^.  to  conform  to  the  conditions 
for  market  share  equilibrium  which  requires  the  adjunction  of  additional 
constraints.  There  are  also  other  kinds  of  market  equilibria  that  might 
need  to  be  considered.  To  create  a  better  perspective  on  some  of  the 

*Ehrenberg  and  Goodhardt  did  not  supply  information  on  sample 
sizes  or  other  statistical  characterizations. 

2 

We  leave  aside  the  more  expedient  computational  and  interpre- 
tational  possibilities  afforded  from  (4)  in  order  to  focus  on  these  more 
immediate  issues. 


the  latter  possibilities  we  shall  therefore  first  consider  how  a  "switching 
equilibrium"  model  might  be  formulated  for  joint  hypothesis  testing  and 
estimation  via 


min  I  ( p : 7T ) 


subject  to 


6  6 

Z  Z  p.  .  =  1 

i=l  j=l  P,J 


pij  =  pji’  Vij»j 


i  0  V  i,j 


The  resulting  estimates  are  portrayed  in  Table  5  with  I*  =  0.007  and  15 
degrees  of  freedom.  The  hypothesis  of  "switching  equilibrium"  is  not  rejected 
for  all  N  <  1430  at  a  *  0.95. 


TABLE  5 

Switching  Equilibrium  Values  ,  i  /  j 


55 

1 

2 

_ 3 

....  4_ 

_ 5 _ 

-  .6 _ 

1 

.240 

.071 

.035 

.025 

.015 

.012 

2 

.071 

.144 

.033 

.013 

.007 

.007 

3 

.035 

.033 

.054 

.009 

.006 

.007 

4 

.025 

.013 

.009 

.031 

.007 

.003 

5 

.015 

.007 

.006 

.007 

.015 

.002 

\  6 

\  .012 

.007 

.007 

.003 

.002 

.015 

1 
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Apparently  the  market  portrayed  in  Table  1  is  consistent  with  the 

* 

hypothesized  switching  equilibrium  and  the  p, ,  values  portrayed  in  Table  5 

*  J 

represent  "best"  estimates  of  the  switching  probabilities  in  the  sense  that 
they  are  "as  close  as  it  is  possible  to  get"  to  the  ir. .  values  in  Table  2 

*  J 

while  retaining  the  symmetry  conditions  for  switching  equilibrium  as 
reflected  in  (16). 


Turning  next  to  the  question  of  market-share  equilibrium  we  now 
formulate  our  model  for  purposes  of  testing  this  model  by  incorporating 
these  conditions  explicitly  as  follows:1 

min  I  (p:ir) 

subject  to 


6 

I  P<<  a  P, 
j=1  U 


(17) 


6 

z 

i*l 


PijiO.  VU 


where  p.  and  p,  represent  the  average  of  the  rim  values  from  Table  2  with  j 

_  1  J  6  _  6  _  I 

p.  *  p.  whenever  i=j  and  E  p.  =  E  p.  =  1.  In  other  words,  we  formalize  the  ! 

1  J  iBl  1  j-1  J  i 

assumption  of  an  hypothesized  market  share  equilibrium  and  then  repeat  the  J 

Compare  with  (11)  and  (12).  Although  Ehrenberg  and  Goodhardt  are  f 

clear  in  their  discussion,  no  such  tests  were  conducted  by  them  —  possibly  | 

because  classical  statistical  mechanisms  are  not  designed  to  deal  with  such  | 

explicitly  constrained  models  involving  "external  constraints"  in  rather  f 

complex  arrays.  I 


same  tests  as  in  (11)  to  ascertain  whether  the  conclusion  of  a  single 
homogeneous  market  is  correct  and  obtain  the  results  shown  in  Table  6. 


TABLE  6 


MARKET  SEGMENTATION  WITH 

MARKET  SHARE  EQUILIBRIUM 

Problem 

Null  Hypothesis 

I* 

d.f . 

N-3750 
a=0. 95 

1 

R1=R2 

2.78xl0”3 

12 

accept 

2 

R1=R2=R6 

3.02xl0“3 

13 

reject 

3 

Rl-R2*  Rg*R^ 

2.88xl0"3 

13 

accept 

4 

R1=R2  1  R6=R4*R3 

2.89xl0‘3 

14 

accept 

5 

^1~^2  »  R6=R4=R3=R5 

2. 99x1 0"3 

15 

accept 

For  N=3,750  the  hypothesis  of  market  homogeneity  would  be  rejected 
since  at  this  sample  size  (and  above)  the  market  segments  into  two.  The 
hypothesized  singleton  classification  in  (8)  is  also  rejected  in  favor  of 
joining  this  classification  together  with  one  in  I2.  However,  for  N  <  3,750 
this  classification  is  rejected  and  the  Ehrenberg-Goodhardt  conclusion  [13] 
of  a  single  homogeneous  market  continues  to  be  maintained  as  before. 

In  conclusion,  we  also  provide  the  p^j  estimates  corresponding  to 
the  hypothesis  of  problem  5  in  Table  6.  These  are  presented  in  Table  7.  Note 
that  these  values  are  consistent  with  the  indicated  segmentation  in  N  =  3,750. 
At  this  sample  size,  the  results  in  Tables  4  and  6  are  consistent  and  at 
N  <  1,430  they  are  also  consistent  with  the  concept  of  switching  equilibrium. 
At  sample  sizes  larger  than  N  *  1,430,  however,  the  hypothesis  of  switching 
equilibrium  is  rejected  while  the  hypothesis  of  market-share  equilibrium 
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continues  to  be  maintained  for  1,430  <  N  <  3,750.  Evidently,  the  two  concepts 
are  not  the  same.  Thus,  even  with  an  hypothesized  and  validated  market-share 
equilibrium  the  vulnerability  ratio  continues  to  provide  valuable  information 
by  the  way  it  summarizes  the  net  switches  in  and  out  and  by  the  way  it 
designates  the  more  vulnerable  products  by  reference  to  repeat  buying 
relative  to  market  share.1 


TABLE  7 

ESTIMATES  OF  SWITCHING/REPEAT  PROBABILITIES  IN  A  TWO-SEGMENT 
MARKET  UNDER  MARKET-SHARE  EQUILIBRIUM 


\  j 

1 

2 

3 

4 

5 

6 

1 

.239 

.076 

.033 

.024 

.012 

.012 

2 

.065 

.143 

.034 

.014 

.007 

.010 

3 

.035 

.031 

.054 

.010 

.007 

.006 

4 

.028 

.013 

.010 

.031 

.006 

.002 

5 

.017 

.006 

.004 

.007 

.016 

.001 

6 

.012 

.005 

.007 

.004 

.003 

.014 

1 


See  the  discussion  in  Section  4. 


CONCLUDING  REMARKS 


A  great  variety  of  other  possibilities  are  also  present,  of 
course,  but  the  above  examples,  with  accompanying  algorithms  and  theorems, 
at  least  provide  a  start  for  the  use  of  MDI  methods  in  marketing  where 
external  as  well  as  internal  constraints  (e.g.,  those  of  the  ordinary 
"rim  condition"  varieties)  are  to  be  considered.  In  the  opening  sec¬ 
tions  of  this  paper,  we  noted  how  statistically  formulated  versions  of  in¬ 
formation  theory  have  been  able  to  suppiy  a  basis  for  unifying  a  great 
body  of  statistical  methodology  (and  concepts).  As  should  be  clear  from 
our  examples,  the  use  of  classical  statistical  methods  on  testing  and 
estimation  problems  such  as  we  have  been  considering  would  be  onerous  and 
difficult  at  best.  The  MDI  approaches  handle  them  easily  and  also  open 
a  variety  of  additional  possibilities.  This  includes  possibilities  for 
further  testing  of  sharpened  hypotheses  when  one  is  willing  to  make  more 
specialized  assumptions  about  the  statistical  distributions  and/or  (as 
in  Sabavala  and  Morrison  [38])  the  underlying  probability  models  governing 
the  behavior  of  individual  consumers.  It  also  opens  the  possibility  of 
contacts  with  mathematical  programming  (and  its  duality  relations)  such 
as  are  provided  in  [17].  In  addition,  these  models  with  their  explicity 
formulated  constraints  can  provide  a  basis  for  sharpening  managerial 
insight  into  marketing  planning  and  control.  The  ultimate  managerial 
benefit  Is  that  approaches  along  these  lines  can  serve  to  direct  manage¬ 
ment  to  activities  that  ought  to  be  undertaken  and  this,  after  all,  is 
what  market  research  is  (or  should  be)  about  if  it  is  to  be  anything  more 
than  a  series  of  academic  exercises. 
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